44

Literature

Eddy SR (2004) What is a hidden Markov model? Nat Biotechnol 22:1315–1316. https://doi.

org/10.1038/nbt1004-­1315

Gibson DG, Benders GA, Andrews-Pfannkoch C et al (2008) Complete chemical synthesis, assem­

bly, and cloning of a mycoplasma genitalium genome. Science 319(5867):1215–1220. https://

doi.org/10.1126/science.1151721

Güell M, van Noort V, Yus E et al (2009) Transcriptome complexity in a genome-reduced bacte­

rium. Science 326(5957):1268–1271. https://doi.org/10.1126/science.1176951 (PubMed PMID:

19965477)

Kühner S, van Noort V, Betts MJ et al (2009) Proteome organization in a genome-reduced bacte­

rium. Science 326(5957):1235–1240. https://doi.org/10.1126/science.1176343 (PubMed PMID:

19965468 *Here, genome and proteome of the small bacterial organism M. pneumoniae is

explained in an exemplary manner)

Lander ES (2011) Initial impact of the sequencing of the human genome. Nature 470(7333):187–197.

https://doi.org/10.1038/nature09792 (*Here, Eric Lander describes what followed from his first

human genome sequence ten years later)

Lander ES, Linton M, Birren B et al (2001) Initial sequencing and analysis of the human genome.

Nature 409(6822):860–921. https://doi.org/10.1038/35057062 (*The landmark paper about the

first description of the human genome)

Liu SJ, Horlbeck MA, Cho SW et al (2017) CRISPRi-based genome-scale identification of func­

tional long noncoding RNA loci in human cells. Science 355(6320). pii: aah7111. https://doi.

org/10.1126/science.aah7111 (*This recent work describes that there are thousands of human

lncRNAs [over 200 nucleotides long], and 16401 lncRNA loci after they have been studied in

seven Cell lines studied in more detail. 499 lncRNAs were identified as essential for cell growth,

with 89% being cell type specific. Presumably, there are also thousands of miRNA loci; the

ENCODE consortium had evidence for many miRNAs).

Patrik D’haeseleer (2006) What are DNA sequence motifs? Nat Biotechnol 24:423–425. https://doi.

org/10.1038/nbt0406-­423

Stormo G (2010) Zhao Y (2010) Determining the specificity of protein–DNA interactions. Nat Rev

Genet 11:751–760. https://doi.org/10.1038/nrg2845

Stormo GD (2013) Modeling the specificity of protein-DNA interactions. Quant Biol 1(2):115–130.

https://doi.org/10.1007/s40484-­013-­0012-­4

Telenti A, Pierce LC, Biggs WH et al (2016) Deep sequencing of 10,000 human genomes. Proc Natl

Acad Sci U S A 113(42):11901–11906 (PubMed PMID: 27702888; PubMed Central PMCID:

PMC5081584 *This paper shows the current state of human genome sequencing: In the mean­

time, even 10000 genomes can be compared on an industrial scale, for instance for conserved

single nucleotide polymorphisms. https://www.ncbi.nlm.nih.gov/pubmed/27702888)

The ENCODE Project Consortium (2012) An integrated encyclopedia of DNA elements in the

human genome. Nature 489:57–74. https://doi.org/10.1038/nature11247 (*The ENCODE con­

sortium has created an encyclopedia of all DNA elements in the human genome and is about 100

times more accurate than the original initial sequencing. It also showed that about half of the

human genome is actively transcribed, much more than the protein genes [30% of the genome;

coding regions only 3%])

Venter JC, Adams MD, Myers EW et  al (2001). The sequence of the human genome. Science

291(5507):1304–1351. Erratum in: Science 292(5523):1838 (PubMed PMID: 11181995 *This

is the famous human genome sequencing paper that J. Craig Venter and his little Armada of

sequencing robots accomplished in just three years)

3  Genomes: Molecular Maps of Living Organisms